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® Process for the production of D-aipha-amino acids. 



@ A process is described for the production of D-a-amino acids by the stereospecific conversion of racemic 
mixtures of 5-substituted hydantoins. where said conversion is earned out in the presence of a microorganism 

2 transfonned with a plasmid capable of expressing at high levels and without inducers an enzymatic system 

" capable of converting said hydandoins in the corresponding D-a-amino acids. 

U> A plasmid is also described comprising the genes which encode said enzymatic system and a microorgan- 
OO ism selected from Escherichia coli or Bacillus subtilis transformed with said plasmid. 

D-o-amino acids are intermediates useful in the preparation of pharmacologically active substances, 
rs pesticides and sweeteners. 
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The present invention relates to a process for the production of D-a-amIno acids by the stereospecific 
conversion of racemtc mixtures of 5-substituted hydantoins witfi a microorganism transformed with a 
plasmid capable of espressing in high yields and without inducers an enzymatic system capable of directly 
converting said hydantoins into the corresponding D-a-amino acids. 
5 The term enzymatic system refers to a system consisting of D-hydantoinase and D-N-carbamylase 
enzymes. 

[>na-amino acids are extremely valuable compounds useful for the preparation of phanmacologically 
active substances (for example, D-phenylglycine and D-parahydroxyphenylglycine are used in the synthesis 
of penicillins and cephalosporins), pesticides (D-valine for the synthesis of the insecticide fluvanilate) or 
10 sweeteners (D-alanine). 

The preparation of these enzymatic systems, however, requires the use of efficient inducers capable of 
stimulating the production of these enzymes on the part of the microorganisms. It is. in fact, known that the 
expression level of the enzymes D-hydantoinase and D-N-carbamylase is constitutively very low (Syldatk et 
al. (1990), "Advances in Biochem. Engineering/Biotechnology (Fiechter. A. Ed.), 41. pages 29-75, Springer- 
75 Verlag, Berlin). 

The inducers normally used are derivatives of hydantoins or n'rtrogenated cyclic compounds which are 
howevery easily metabolized by the microorganisms, or compounds such as uracil or thio-2-uracil or 
thymine which are not metatxDiized (Meyer et al., (1993), Fems Microbiol. Letters, 1^: 67-74). 

The use of inducers creates a series of drawt)acks among which an increase in the production costs 
20 and a certain variat>ility in the production yields of the enzymes. In addition, the expression level which can 
be obtained in most of the microorganisms following induction is insufficient for economical use in industrial 
processes (Syldattc et al. (1987). Biotechnol. lett., 9: 25-30; Yokozeki et al. (1987) Agric. Biol. Chem., 51, 
715-722). 

Recentiy the genes which encode the enzymes D-hydantoinase and D-N-cart>amylase have been 
25 individually sequenced and cloned (US 4.912.044 and EP-51 5-698). 

More specifically, patent US 4.912.044 describes the preparation of D-hydantoinase by the fermentation 
of a microorganism transformed with a hybrid vector containing the hydantoinase gene whose expression is 
induced by temperature variation. The enzyme thus obtained is used for the production of D-N-carbamyl 
derivatives from 5-substituted hydantoins. 
30 Patent application EP-51 5.698 describes, on tfie other hand, tiie preparation of D-N-carbamylase by the 
fermentation of a microorganism transformed with a plasmid comprising the carbamylase gene whose 
expression is chemically Induced with IPTG. The enzyme thus obtained is used for the production of D-a- 
amino acids from N-carl)amyl derivatives. 

As industrial interest is directed towards the conversion of racemic hydantoins to D-a-amino acids, the 
35 fact that the two enzymes are expressed in different strains involves tfie use of both and consequentiy the 
development of a process starting from two distinct fennnentative processes. 

This obviously increases the production costs and reduces the conversion kinetics. In fact. In order to 
complete the enzymatic reaction, the N-cart>amyl derivative produced by tfie transformed microorganism 
containing the hydantoinase must pass through the bacterial membrane, spread into the reaction medium 
40 and then proceed in the opposite direction to reach the second enzyme (carbamylase) present in the other 
strain. All ttiis is particulariy penalizing from the point of view of kinetics considering the reduced 
permeability of the bacterial membranes to the carbamyl derivatives (Olivieri et al. (1981), Biotechnol. 
Bioeng., 23, 2173-2183) and the inevitable dilution of the caribamyl itself in tiie reaction mixture. 

Rnally, the use of a double volume of bkxnass has a negative influence on tfie yields and degree of 
45 purity of the final product. 

In addition, the necessity of having to induce the expression of these enzymes creates a furtfier 
problem thus making these processes of Irttie interest for practical use. 

The object of the present invention is to overcome tiie disadvantages of the known art described above. 
In particular It has now been found, In accordance with the present Invention, that the use of a particular 
50 plasmid which contains the genes of D-hydantoinase and N-carit>amy)ase put under the control of an 
appropriate synthetic promoter, enables the high expression of these enzymes to be obtained without 
inducers. 

It is therefore possible to prepare a single mk:roorganism transformed with said plasmid containing the 
two enzymatic activities inside. This solution solves not only the problems relating to kinetics due to the 
55 limited permeability, as the two reactions occur Inside the same cell where the concentration of the 
substrates is excellent, but also those relating to tiie requirement of inducers and treatment of tiie product 
and of the waste products. 
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In accordance with this, a first aspect of the present invention relates to a process for the production of 
D-a-amino acids by the stereospecific conversion of racemic mixtures of 5-substituted hydantoins char- 
acterized in that, the conversion reaction is carried out in the presence of a microorganism transformed with 
a plasmid capable of expressing at high levels and without inducers an enzymatic system capable of 
5 converting said hydantoins into the corresponding D-a-amino acids. 

A furtfier object of the present invention is the plasmid pSM651 comprising the genes which encode the 
enzymatic system. 

Yet another object of the present invention is a microorganism transfored with the plasmid pSM651 
capable of expressing with high efficiency and without inducers an enzymatic system capable of 
70 stereospecifically converting racemic mixtures of 5-substituted hydantoins into the corresponding D-a-amino 
acids. 

A further object of the present invention relates to the use of said microorganisms or enzymatic system 
isolated from said microorganisms for the production of [>a-amlno acids by the stereospecific conversion of 
racemic mixtures of 5-sut>stituted hydantoins. 
75 Further objects of the present invention will be evident from the description and examples below. 

Brief description of the figures 

Rgure 1 : Map of the plasmid pSM637 containing the carbamylase gene 
20 Figure 2 : Map of the plasmid pSM650 containing the hydantoinase gene 

Figure 3 : Map of the plasmid pSM651 containing the hydantoinase-carbamylase operon. 
Figure 4 A-B : Nucleotide and amino acid sequence of cart>amylase. 
Rgure 5 A-C : Nucleotide and amino acid sequence of hydantoinase. 

Figure 6 : SDS-PAGE (A) and Westem-Blot (B) of the total proteins extracted from cultures of E.coli and 
25 B.sutytilis transformed with the plasmid pSM651 wherein: 

line 1 : standard hydantoinase 

line 2: standard carbamylase 

line 3: E.coli (pSM671 ) control 

line 4: E.coli SMC305 
30 line 5: B.subtilis (pSM671) control 

line 6: B.subtilis SMS373 

Tlie genes which encode the E>hydantoinase and D-N-carbamylase enzymes can be isolated from 
microorganisms such as Pseudomonas, Hansenula. Agrobacterium. Aerobacter. Aeromonas. Bacillus, 
Moraxella. Brevibacterium. Flavobacterium. Senratia, Micrococcus. Arthrobacter or Paracoccus. Specific 

35 examples of these microorganisms are Bacillus macroides ATCC 12905, Aerot^acter cloacae lAM 1221, 
Agrobacterium sp. IP 1-671, Agrobacterium radiobacter NRRLB 11291, Pseudomonas sp. FERM BP 1900. 

The isolation of the genes which encode the D-hydantoinase and D-N-cart)amylase enzymes can be 
carried out by the constiuction of a gene library, representing fhe genome of the microorganism, 
identification of the clones containing the genes which encode said enzymes, analysis of the gene 

40 sequence, insertion of said genes into a vector and control of their expression. 

The term gene library or genome bank means the combination of clones of a given host microorganism 
each of which carries a fragment of the chromosomal DNA deriving from the donor organism of which the 
t>ank is to be obtained. A t>ank is defined as being representative when the combination of the single 
fragments contained in each clone forms the majority of the chromosomal DNA of the donor organism. 

45 According to a preferred embodiment of the process of the present invention, the strain A.radiobacter 
NRRL B-11291 is used as donor organism for the isolation of the genes which encode D-hydarttoinase and 
D-N-cart)amylase. 

In practice, two genome banks of said mk:roorganism are constructed in E.coli by digesting the 
chromosomal DNA separately with the restriction enzymes Bam HI and Sad. Among the fragments obtained 
50 with the two digestions, those having dimensions nonmally of between 3,000 and 4,500 bp are purified. The 
selection is carried out by estimating the molecular weight of the D-hydantoinase and D-N-carit>amylase 
enzymes of 50,000 and 34,000 Daltons respectively. 

The two populations of BamHI and Sad fragments are then ligated to a vector of E.cort under such 
conditions as to facilitate ttie condensation of a single fragment to each molecule of the vector. The two 
55 ligase mixtures are used to transform cells of E.coli made competent as shown for example by Dagert, M. 
and Ehrirch (1979). (Gene, 6:23). 

The two populations of colonies (genome t>anks) thus obtained, each of which canrying a hybrid plasmid 
i.e. consisting of the molecule of the vector and a chromosomal DNA fragment of A.radiobacter , are then 
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selected to identify those clones containing the hydantoinase and cart)amylase genes. 

The identification can be canied out by direct expression or using specific probes. The second method 
Is preferably used. For the selection of the probes, in the case of cart>amylase, reference was made to the 
knowledge of the amino^nd sequence of carbamylase by Comomonas sp. 5222c (Ogawa et al. (1993). Eur. 
5 J. Biochem.. 212: 685-691 ). 

On the basis of this sequence small oligonucleotides are synthesized which, once marked, are used for 
the screening of the genothecas by hybridization techniques (Maniatis et al., (1982), "Molecular Cloning: a 
laboratory manual". Cold Spring Hart>or Latx)ratory). 

This permitted the identification of a clone carrying a hybrid plasmid carrying a BamHI fragment 
10 containing the nucleotidic sequence which encodes for the whole carbamylase. Analysis of said plasmid 
showed, in addition, the presence of a second incomplete ORF, placed on the other strand with respect to 
the cart>amylase gene, which showed a homok)gy with urease portions isolated from vanous microorgan- 
isms. 

As ureases, like hydantoiases. are enzymes belonging to the group of amido-hydrolases, it was 
75 assumed that the incomplete ORF corresponded to that of hydantoise. This assumption was then confirmed 
by the enzymatic activity tests carried out on cellular extracts of cells carrying the identified gene. 

In order to isolate the whole nucleotide sequence encoding hydantoinase, a screening of the gene 
library of the DNA of A.radiobacter digested with SacI was carried out by hybridizatran with an 
oligonucleotide synthesized on the t>asis of the nucleotide sequence of the incomplete ORF. 
20 The screening led to the isolatk>n of a clone containing the whole hydantoinase gene. The genes thus 
isolated were sequenced using the sequenase version Kit 2.0 sold by United State Biochemical. 

For the construction of a plasmid comprising both of the isolated genes vectors selected from plasmids, 
cosmids and bacteriophages known in tfie art. can be used. 

The bifunctional plasmid of E.coli and B.subtilis , pSM671 CBS 205.94 is preferably used. 
25 This plasmkS. which comprises the genes which encode for resistance to kanamydn and chloram- 
phenicol and has replication origins operable in E.coli and B.subtilis , is characterized in that it contains a 
synthetic promoter capable to direct with high efficiency and without inducers, the expression of the genes 
put under its control. 

In practice, the DNA fragments containing the genes which encode the D-hydantoinase and D-N- 
30 cart)amylase enzymes are cbned into the plasmid pSM671 In the unique restriction sites EcoRI and Hindlll 

obtaining the recombinant plasmid pSM651 . 

The construction can be carried out operating according to the general techniques known in the field of 

recombinant DNA. In order to verify whether these enzymes are expressed from B.subtilis and E.coli , cells 

transformed with said plasmid are cultured in a suitable culture medium. The total proteins, extracted from 
35 the cellular lysate, analyzed on polyacrylamide gel showed the presence of two proteins having a molecular 

weight con-esponding to that of the two enzymes; these proteins represent atx)ut 10% of the total proteins. 

These results confirm the capacity of B.subtilis and E.coli to express said enzymes with high efficiency and 

witfK>ut inducers. 

The enzymatic system of the present invention can t>e obtained by culturing tfie strains E.coli or 
40 B.subtilis transformed with the plasmid pSM651. under aerobic conditk>ns, in an aqueous medium contain- 
ing assimilable sources of carbon and nitrogen as well as various cations, anions and, possibly, traces of 
vitamins, such as biotin. thiamine, or amino acids. 

Assimilable carit)on sources comprise carbohydrates such as glucose, hydrolized amides, molasses, 
sucrose or other conventional cartx)n sources. 
45 Examples of nitrogen sources can be selected, for example, from mineral ammonium salts, such as 
ammonium nitrate, ammonium sulphate, ammonium chloride or ammonium carkK)nate and urea or materials 
containing organic or Inorganic nitrogen such as peptone, yeast extract or meat extract. 

The folk>wing cations and anions are equally suitable for the object of the present invention: potassium, 
sodium, magnesium, iron, calcium, acid phosphates, sulphates, chlorides, manganese, and nitrates. 
50 The fermentation is carried out, under stirring, at a temperature of between 25* and 40 'C. preferably 
between 30 • and 37 * C and at a pH of between 6 and 7.5, preferably tjetween 6.5 and 7.0. 

The cells (tuomass) recovered from the culture medium by means of the conventional techniques such 
as centrifugation or filtration are used in the conversion phase of the racemic mixtures of 5-substituted 
hydantoins. 

55 Altematively, the conversion reaction can be carried out using either the cellular extract obtained from 
the disintegration of the cells by sonication or French-Press, or enzymes purified or partially purified with 
the conventional methods, or enzymes immobilized on insoluble supports. 
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Numerous hydantoins substituted in position 5 can be used in the process of the present invention. 
Possible substituents in position 5 are selected from a linear or branched alky! group with a number of 
carbon atoms of between 1 and 6, which can t>e mono or polysubstituted with hydroxy, carboxy, 
hydrosulphide or amino groups or a phenyl or benzyl group which, in turn, can contain one or more 
5 substituents In ortho, meta and para position. Examples of 5-substituted hydantoins are: D,L-5-phenylhydan- 
toin, D.L-5-para-hydroxyphenylhydantoin, D,L-5-mettiylhydantoin. D.L-J5-isopropylhydantoin. D.L-5-ttiienyl- 
hydantoln. D.L-5-para-methoxyphenylhydantoin, D,L-5-para-chloro phenylhydantoin, D,L-5-benzylhydantoin. 

The conversion of the hydantoins into the corresponding [>a-amino acids is canied out in a nitrogen 
atmosphere in a hermetically closed apparatus, at a temperature of between 20 and 60 •C. preferably 
70 between 30 and 45 * C. 

The pH of the reaction medium is maintained wltiiin values of between 6 and 10 and preferably 
between 7 and 8.5. This regulation of the pH can be carried out, for example, by adding a base aqueous 
solution such as an aqueous solution of ammonia, potassium hydroxide, sodium hydroxide, sodium or 
potassium cart)onate. 

75 The initial concentration of the hydantoins is generally between 2% and 30% by weight As a result of 
the stereospeclficity of the enzymes produced from the strains of tiie present invention, only the D- 
enantiomorphs of the hydantoins are hydrolized. As hydantoins however, spontaneously racemize more or 
less rapidly under the operating conditions, the L-enantiomorphs are completely converted into the 
corresponding D-a-amino acids. 

20 The quantity of biomass which is added to the reaction mixture depends on the particular affinity of the 
substrate towards the enzymes. Generally a ratio by weight biomass/hydantoins of between 1/1 and 1/50 
can be used. 

When the conversion reaction is carried out under optimum conditions a yield of 95-98% is obtained. 
The D-a-amino acids prepared with the process of the present Invention can be recovered from the 
25 reaction medium with the conventional n>ethods such as ion-exchange chromatography or precipitation of 
the amino acid at its isoelectric point. 

The plasmid pSM651 was deposited at the Bureau Voor Schimmelcultures, SK Baam (Holland) as E.coli 
SMC305 where it received the deposit number CBS 203.94. 

The following experimental examples provide a better illustration of the present invention but do not 
30 limit it in any way. 

Example 1 

Extraction of the chromosomal DNA from A.radiobacter 

35 

100 ml of fermentation medium having the following composition: 
1% glucose. 0.3% yeast extract. 1.36% KH2P0*. 0.02% MgS04.7H20 (pH 7.0) were inoculated witti ttie 
strain A.radiobacter (NRRLB 1 1291) and maintained under stirring (220 rpm) at 30 • C for 24 hours. 

The cells were then recovered by centrifugation of the culture broth in an SS34 rotor model Sorvall RC- 
40 5B (at 4* C and 5000 rpm for 10 minutes) and then washed (2x120 ml) with a solution (TE) containing 1 mM 
EDTA. 10 mM Tris-HCI. pH 7.4. The resulting suspension was centrifuged again as above and the cells 
were recovered and resuspended in 9.5 ml of TE solution. After adding 0.5 ml of 10% SDS (sodium 
dodecylsulphate) and 50 ul of a solution of Proteinase K (20 mg/ml). the suspension was incubated at 37*C 
for 1 hour. 

45 1.8 ml of NaCI 5 M and 1.5 ml of a solution consisting of 10% hexadecyltrimethyl ammonium bromide 
(CTAB) in 0.7 M NaCI were subsequentiy added and the resulting solution was incubated at 65* C for 20 
minutes. The solution was then deproteinized with an equal volume of chlorofonn/isoamyl alcohol (24/1 , v/v) 
and tiie DNA was precipitated witt) 0.6 volumes of isopropanol. The DNA was washed witti 1 ml of etiianol 
(70%) and recovered with a glass rod. The recovered DNA was finally dissolved in 4 ml of TE and its 

50 concentration was determined by spectrophotometry at 260 nm. 

The chromosomal DNA was purified again by centrifugation on a gradient of CsCI (1%) containing 0.1 
mg/ml of ethidium bromide (55,000 rpm for 16 hours in a Beckman rotor V65Ti). 

Ihe DNA band was visualized under a UV light and the etiiidium bromide was removed by exti^on 
with butanol saturated in HzO. After dialysis against a TE buffer, the DNA was precipitated with ethanol and 

55 resuspended in the desired volume. 
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Example 2 

Construction of a genomic bank of A.radiobacter 

5 Aliquots (10 tig) of tfie DNA ttius obtained were digested, separately, with 25 units of each of the 
restriction enzymes EcoRI, PstI, BamHI. Sad. and SphI (Boehringer) operating according to the instructions 
of the producer. 

After blocking the enzymatic reactions at 65" C for 10 minutes, the reaction mixtures were charged onto 
agar gel at 0.8% and mn at 100 volts for 2 hours. The DNA bands, visualized by coloring with EtBr (0.5 
10 gamma/ml), were then transfenred onto a nybn filter (Boehringer) and after lysis with NaOH, the DNA was 
immobilized according to the Southem blot technique (Maniatis at al., "Molecular Cloning: a practical 
laboratory manual". Cold Spring Hart)or, New York, 1982). 

The filter was hybridized at 45-C with each of the degenerated oligonucleotides, conceived on the 
basis of the amino-end of the carbamylase of Comamonas sp. E222c (Ogawa et al.. (1993). Eur. J. 
15 Biochem., 212: 685-691), having the sequence: 

1) 5»CGA ATT GTA AAT TAT GCA GCA GC 3' 
20 AGCGCCGG 

C A C C C 

T T T T 

25 

2) 5*GGA CCA ATT CAA C6A GC 3* 

G G C G G 

30 

CCA C 
T T T 
^ 3) 5'CGA GCA GAT GTA ATG GA 3' 

A G G C G 



T T T 



45 These oligonucleotides were synthesized using the automatic System OLIGO 1000 system of Beck- 
mann and then marked at the 5'end using the kit DIG SYSTEM (Boehringer). The hydridation reaction with 
probe 2 gave positive signals. In particular, the DNA digested with BamHI generated a fragment of atxxjt 
4000 bp capable of hybridizing the probes. 

To Isolate the BamHI fragment thus Identified, 10 ug of chromosomal DNA were suspended in 50 ii\ of 

50 buffer 10 mM Tris-HCI pH 8, 100 mM NaCI. 5 mM MgCb. 1 mM 2-mercaptoethanol and incubated at 37'C 
for 4-5 hours In the presence of 25 U of the enzyme BamHI. 

The digestion mixture was then subjected to electrophoresis on agar gel at 0.8% and. after colouring 
with EtBr. DNA fragments of 3,500 - 4,500 bp were electroeluated in the electrophoresis buffer (Maniatis et 
al. "Molecular Cloning: a practical laboratory manual". Cold Spring Habor, New Yoric 1982). 

55 The chromosomal DNA fragments in the plasmid pUCIB (BRL) were then cloned. In practice, 20 ng of 
this plasmid. previously linearized with the restriction enzyme BamHI. were llgated with 1(X) ng of the 
chromosomal DNA fragments in 20 u! of mixture containing 66 mM Tris-HCI pH 7.6, 1 mM ATP. 10 mM 
MgCb. 10 mM Dithiotreitol (DTT). In the presence of 1 U of T4 DNA Itgase, at 16* C for a night. 
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The ligase mixture was used for transforming cells of E.coli JM101 (BRL) made competent with 50 mM 
CaCa (Dagerl. M. and Ehrlich (1979). Gene. 6:23). 

"me transformants were subsequently selected on plates of LB medium (8 g/l Bactotryptone (DIFCO), 5 
g/l NaCI, 15 g/l Agar (DIFCO), 0.5 g/l yeast extract) to which 40 ug/ml of X-Gal (5-bromo-4-chloro-3-indolyl- 
D-thio-galactopyranoside) and 100 ug/ml of ampicillin had been added. 

Operating as described above numerous positive recombinant colonies (white) were obtained which 
were easily distinguishable from those not recomk>inant (blue). 

The positive clones were transferred onto nylon filters (Boehringer) and the DNA extracted from these 
clones was hybridized under the same conditions using probe 2 which had responded positively to 
hybridation with the chromosomal DNA. 

The plasmids extracted from the clones which gave a positive signal were sequenced using the 
Sequenase version 2.0 Kit (United States Biochemical). One of these plasmids. containing the complete 
carbamylase gene (915 bp) was called pSM652. 

Figure 4 shows the nucleottdic and amino acidic sequence of cart)amylase. 

Example 3 

Isolation of the hydantoinase gene of A.radiobacter 

Analysis of the plasmid pSM652 showed the presence of a second incomplete ORF, situated on the 
other strand with respect to the carbamylase gene, which showed a homology with urease portions isolated 
from various microorganisms. 

As ureases, like hydantoinases. are enzymes belonging to the group of amidohydrolases. it was 
assumed that the incomplete ORF conresponded to that of hydantoinase. The assumption was then 
confirmed by enzyn)atic activity tests carried out on cellular extracts of cells carrying the identified gene. 

In order to isolate the whole nucleotidic sequence encoding the hydantoinase. the same Souttiem Blot 
used for isolating the carbamylase was hybridized using as probe the oligonucleotide having the sequence: 
5' ATC GTA ACC GCG (SAG GGG ATT TCT CCC 3* 

This oligonucleotide, homologous to the 5*end region of the nucleotidic sequence of identified partial 
ORF. was synthesized and mari^ed as shown in example 2. Among the positive bands for this probe a band 
of about 3500 bp obtained from the digestion of the DNA with the enzyme Sad, was identified. 

Operating as shown in example 2 a genomic bank of chromosomal DNA of A.radiobacter digested with 
Sad was then constructed. Screening of this bank led to the isolation of the plasmid pSM653 containing the 
whole gene for hydantoinase whose nucleotide and amino acid sequence is shown in figure 5. 

Example 4 

Cloning of the carbamylase gene 

1) Amplification of the cart>amylase gene 

The plasmid pSM652 was amplified by the Polymerase Chain Reaction (PCR) technique, according to 
the Indlcaltons specified by Leung et al. (Leung D.W., Chen E.. Goeddel D.V.. Technique - a journal of 
methods in cell and mdecular biobgy. 1, No. 1 (1989): pages 11-15), using the pair of oligonucleotides: 

(1) 5* GG G AAT TCT TAT GAC ACQ TCA G 3« (FORWARD) 

EcoRI 

(2) 5* CCC AAG CTT CAA AAT TCC GCG AT 3» (REVERSE) 

Hindlll 

The oligonucleotide (2) also allowed the restriction site EcoRI present inside the carbamylase gene near 
3'end. to be eliminated. 
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The amplification was carried out in a DNA Thermal Cycler 480 apparatus (Perkin - Elmer Cetus) using 
a reaction mixture (100 ul) containing 10 mM Tris HCI pH 8.3. 1.5 mM MgCb, 50 mM KCl, 0.01% 
(weight/volume) of gelatine. 1 ng of pSM652. 1 uM of the two primers. 200 uM of dNTP, 0.5 Units of Taq 
polymerase (Perkin Elmer). 

5 A drop of mineral oil is added and the mixture Is denatured for 4 minutes at 94-C and the cyclic 
program is started, which comprises: 
1 minute at 94*C (denaturation) 

1 minute at 55 * C (annealing) 

2 minutes at 72 • C (elongation) 

10 for a total of 30 cycles, followed by 8 minutes at 72* C (final extension). 

The amplification product thus obtained was treated with phenol-chloroform (1:1). precipitated with NaCI 
(1/10 vol/vol) and EtOH (2 vol) and resuspended in 20 ul of hfeO. After cutting with the restriction enzymes 
EcoRI and Hindlll (5 U) suitable for cloning into the plasmid pSM671 (CBS 205.94) the DNA fragments were 
purified on low-melting gel (SeaPlaque. FMC BioProducts) at 1 .0% and the bands eluated by the gel were 
15 treated with GEIase (Epicentre Technologies) (1 U every 300 ug of gel weighed) for 1.5 hours at 45*C. 
At the same time, 50 ng of the plasmid pSM671 were cut with the same restriction enzymes. 
The plasmid and fragments were ligated in 10 ul of reaction mixture (DNA 20 ng/ml) and 2 ul of this 
mixture were used for transforming cells of E.coli 71/18 made competent with CaCfe (Dagert and Ehriich, 
Gene. 6: 23, 1979). The transformants were selected on plates of LB medium containing 20 ug/ml of 
20 chk)ramphenkx)l. 

The plasmid DNA extracted from the positive ctones was analized to verify exact insertion into the 
carbamylase gene and the al)sence of possible errors caused by the amplification. 
One of these plasmids was called pSM637. 

The strain of E.coli containing the plasmkl pSM637 was called SMC307. 
25 Cells of B subtilis SMS108 NRRLB-1 5.898 made competent as described in "Molecular Biology 
Methods for Bacillus", (1990) (Harwood and Cutting (eds) Wiley and Sons) were transformed with 100 ng of 
the plasmid pSM637 operating according to the known techniques, and the transformed strain was called 
SMS374. 

30 Example 5 

Expression of the carbamylase gene in E.coli and B.subtilis 

The object of the experiment was to verify the ability of the transformed strains (E.coli SMC307 and 

35 B.subtilis SMS 374) to express the carbamylase gene without inducers. 

A preculture on slant of the strain E.coli SMS307 and B.subtilis SMS 374 was inoculated into two 100 
ml flasks containing, respectively, 10 ml of LB medium to which 20 ug/ml of chloramphenicol had been 
added and 10 ml of VY medium to which 5 ug/ml of chloramphenicol had been added. The flasks were 
incubated, under stining. (220 rpm), at 37 • C for 16 hours. 

40 The cells were recovered by centrifugation (12,000 rpm. 4 'C, for 1 minute) of the two culture broths, 
resuspended in 300 u I of buffer 20 mM Tris-HCI pH 7.5, 20 mM BMeOH, 20% glycerol and lysed by 
sonication (Soniprep150, MSE 1 minute Impulses, at average voltage). Allquots (15 ul) of the two lysates 
were charged onto polyacryalamide gel at 10% and run at 20 mA for three hours. The proteic bands were 
visualized by colouring with Coomassie R-250 (Laemmli, Nature: 227, 680. 1970). After cokxjring with 

45 Coomassie a proteic tjand was revealed with a molecular weight of 34.0(X) D absent In the extracts of 
untransformed strains B.subtilis SMS108 and E.coli 71/18. In addition, densitometric analysis carried out on 
the same gel coloured with Coomassie showed that this protein was expressed in both of the transformed 
strains as one of the prevalent proteins (10% with respect to the total proteins). 

50 Example 6 

Cloning of the hydantoinase gene 

The plasmid pSM653 (1 ug) was digested with the restriction enzymes EcoRV and Sail (4 U) 
66 (Boehringer) at 37 • C for 1 hour. 

The digestion mixture was then subjected to electrophoresis on agar gel at 0.8% (low melting) and, 
after colouring with EtBr, the DNA band conresponding to an EcoRV-Sall fragment of 1300 bp was recut and 
the DNA extracted with the Gelase TM method (EPICENTRE Technotogies). As this fragment has a small 
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region missing at the 5'end and a portion of 70 bp at 3' end. the whole hydantoinase gene was 
reconstructed using two linkers having the sequence: 



LINKER 5' 

5' AATTCT TATG GAT 3* 

EcoRI 
LINKER 2* 

5> TCGA CGAGGG AACCTACGTG GGGGCGCCGA CGGATGGCCA 
Sail 

GTTCCGGAAG CGCCGCAAAT ACAAGCAATA AGGAGg 3« 

EcoRI 

40 ng of the 1300 bp fragment. 40 ng of the linlcer 3', 10 ng of the linker 5' and 50 ng of the plasmid 
pSM671 CBS 205.94, previously linearized with EcoRI, were then iigated in a ligase mixture containing 1 U 
of T4 DNA ligase, incubating at 12* C for 16 hours. The ligase mixture was subsequently used to transform 
competent cells of E.coli 71/18 and the transformants were selected on plates of LB medium to which 20 
ug/ml of chloramphenicol had been added. 

Ihe plasmid DNAs isolated from some of the positive clones were analyzed to Identify the clones 
containing the complete sequence of the hydantoinase gene. 

One of these plasmids was called pSM650 and the strain of E.coli containing said plasmid was marked 
with the abbreviation SMC308. 

100 ng of the plasmid pSM650 were used to transform competent cells of B.subtilis SMS108. The resulting 
strain was called SMS375. 

Example 7 

Expression of the hydantoinase gene in E.coli and B.subtilis 

The object of the experiment was to verify the capacity of the transformed strains (E.coli SMS308 and 
B.subtilis SMS375) to express the hydantoinase gene without inducers. 

Apreculture on slant of the strain E.coli SMS308 and B.subtilis SMS375 was inoculated into two 50 ml 
flasks containing, respectively. 10 ml of LB medium to which 5 ug/ml of chloramphenicol had been added 
and 10 ml of VY medium to which 20 ug/ml of chtoramphenteol had been added. The flasks were 
incubated, under gentle stirring, (220 rpm), at 37'C for 16 hours. 

The cells were recovered by centrifugation (12,000 rpm. 4*C, for 1 minute) of the two culture broths, 
resuspended in 300 ul of buffer 20 mM Tris-HCI pH 7.5. 20 mM BMeOH. 20% glycerol and lysed by 
sonication (1 minute impulses, at average voltage). Aliquots (15 ul) of the two lysates were charged onto 
polyacryalamide gel at 10% and run at 20 mA for three hours. The proteic bands Y/ere visualized by 
colouring with Coomassie R-250 (Uemmli. Nature: 227. 680. 1970). After colouring with (Joomassie a 
proteic band was revealed with a molecular weight of 50.000 Daltons absent in the extracts of untransfor- 
med strains B.subtilis SMS108 and E.coll 71/18. In addition, densitometric analysis canied out on the same 
gel coloured with Coomassie showed that this protein was expressed in the two transformed strains as one 
of the prevalent proteins (10% with respect to the total proteins). 
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Example 8 

Cloning of the hydantoinase-carbamylase operon 

5 The plasmid pSM650 (1 ug) was digested with the enzyme EcoRI (5 U) at 37*C for 1 hour. The EcoRI- 
EcoRI fragment of atx)ut 1380 bp containing the hydantoinase gene was purified by agar gel at 0.8% with 
the Gelase TM method. 20 ng of this fragment were ligated with 50 ng of the plasmid pSM637 linearized 
with EcoRL The reaction was carried out in a ligase buffer containing 1 U of T4 DNA ligase, at 16* C for 16 
hours. 

10 The ligase mixture was used to transform competent cells of E.coli 71/18. 

The transformants were subsequently selected on plastes of LB medium (8 g/1 Bactotryptone (DIFCO), 
5 g/l NaCI, 15 g/l Agar (DIFCO). 0.5 g/I yeast extract) to which 20 ug/ml of Chloramphenicol had been 
added. 

The positive clones were analyzed by restriction analysis to verify the correct insertion into the two 
IS genes. The plasmid containing the hydantoinase-carbamylase operon was called pSM651 and the strain of 
E.coli containing said plasmid was marked with the abbreviation SMC305. 

Competent cells of B.subtilis SMS108 were transfonned with 100 ng of this plasmid. One of the positive 
clones was called SMS373. 

20 Example 9 

Expression of the hydantoinase-carbamylase operon 

E.COII SMS305 and B.subtilis SMS373 were cultured, respectively, in 100 ml of LB medium to which 20 
25 ug oFchioramphenicol had been added and in 100 ml of VY medium to which 5 ug of chloramphenicol had 
been added, at 37 'C for 16 hours, under stinring (200 rpm). The proteic extracts obtained from the cellular 
lysates were analyzed as described in example 7. The results showed the presence of two proteins 
corresponding to hydantoinase and carbamylase (figure 6). To evaluate the activity of these enzymes, a 
reaction kinetics was carried out using 20 mM (D.L) parahydroxyphenyl-hydantoin as substrate or alter- 
30 natively 5-phenyl-hydantoin (in 200 mM of phosphate buffer pH 8) and following the conversion into the 
corresponding D-a-amino acid with the evolution of ammonia. The process adopted is described by 
Weatherburn, M.W.. (1967), (Anal. Chem.. 39:971). 

Example 10 

35 

Conversion of D,L-5-phenylhydantoln to D-phenylglycine 

A suspenskxi of 2 g of D,L-5-phenyl-hydantoin in 100 ml of Na-phosphate 0.2 M buffer pH 8.0 was 
charged into an apparatus equipped with a stinrer and thermostat-regulated at 40*C. After degassing with 
40 nitrogen at 40* C for 5 minutes, 5 g (humid weight) of biomass was introduced, coming from a culture of 
Ecoli SMS305, carried out as described in example 9. 

After the apparatus had been henfnetically closed, the reaction mixture was maintained under a nitrogen 
atmosphere, at 40*0 for 24 hours. Polarimetric and thin layer chromatographic analysis (J. of Chromatog- 
raphy. 80: 199-204). 1973) of an aliquot of the reaction mixture showed the complete hydrolysis of the 
45 starting~substrate to D-phenylglyclne. 

After separation of the biomass by centrifugatk>n of the reaction mixture at 6000 rpm for 10 minutes, the 
sumatant was acWified to pH 1.0 with HCI 6 M and charged onto a column (2.6 x 20 cm) of Amberiite IR 
120 (activated with HCI). The column was then washed with water and eluted with an ammonia solution at 
5% in water. The eluate was decoloured with decolouring cartx)n (C.Erba), and the decoloured solution was 
50 concentrated under vacuum and brought to pH 5.8. The crystals thus obtained were recovered by filtratton 
and recrystallized from water. The white powder obtained (1.63 g) showed a specific rotation [a]D20 = 
-156' (c = 1. 1 N HCI). The IR spectrum was in agreement with that of the standard D-phenylglycine. 
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Example 11 

Conversion of D.L-5-phenylhydandoin to P-phenylglycine. 

5 The same procedure was canled out as in example 10. using 5 g (humid weight) of biomass coming 
from the culture of E.colt SMS305 and 10 g of D.L-5-phenylhydantoin in 100 ml of Na-phosphate 0.2 M 
buffer pH 8.0, The reaction was carried out under a nitrogen atmosphere, at 40* C for 90 hours. The white 
powder obtained (8.1 g) showed a specific rotation [ai^ = -156.5* (c = 1. 1 N HCI). Tlie IR spectrum 
agreed with that of the standard D-phenylglycine. 

10 

Example 12 

Conversion of D.L-5-para-hydroxy-phenylhydantoin to D-para-hydroxy-phenylglyctne 

T5 The same procedure was canried out as in example 10, using 2.5 g (humid weight) of biomass and 1 g 
of D.L-5-para-hydroxy-phenylhydantoin The D-para-hydroxy-phenylglycine obtained as a white powder (0.82 
g) showed a specific rotation [a]i>20 = -158* (c = 1.1 N HCI). The IR spectrum was in agreement with that 
of the standard D-phenylglycine. 

20 Example 13 

Conversion of D.L-5-para-hydroxy-phenylhydantoin to D-para-hydroxy-phenylgfycine 

The same procedure was carried out as in example 10. using 2.5 g of biomass (humid weight) obtained 
25 from the culture of E.coli SMS305 and 8 g of D,L-5-parahydroxy-phenylhydan1oin. 

The reaction was carried out under a nitrogen atmosphere, at 40*0 for 170 hours. The D-parahydroxy- 
phenylglycine obtained as a white powder (6.6 g) showed a specific rotation [a]D20 = -157.8* (c = 1,1 N 
HCI). The IR spectrum was in agreement with that of the standard D-phenylglycine. 

30 Example 14 

(k)nversion of D.L-5-isopropylhydantoin to D-valine 

The same procedure was carried out as in example 10, using 5.0 g of biomass (humid weight) obtained 
35 from the culture of E.coli SMS305 and 2 g of D.L-5-isopropylhydantoln. 

The reaction was carried out under a nitrogen atmosphere, at 40*C for 240 hours. The D-valine 
obtained as a white powder (0.8 g) showed a specific rotation [ab20 = -27.5* (c = 5. 6 N HCI). The IR 
spectrum agreed with that of the standard D-valine. 
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SEQUENCE LISTING 

NUMBER OF SEQUENCES: 19 

(1) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(li) MOLECULE TYPE: DNA (Genomic) 
(Xi) SEQUENCE DESCRIPTION: 

CGAATTGTAA ATTATGCAGC AGC 23 
(1) INFORMATION FOR SEQ ID N0x2: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
AGGATCGTGA ACTACGCGGC GGC 23 
(1) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA(Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGCATAGTCA ATTATGCCGC CGC 2 
(1) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DIIA(Genonic) 
(xi) SEQUENCE DESCRIPTION: 
CGTATTGTTA ATTATGCTGC TGC 

(1) INFORMATION FOR SEQ ID NO: 5: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(li) MOLECULE TYPE: DNA(Genomlc) 
(xi) SEQUENCE DESCRIPTION: 
GGACCAATTC AACGAGC 
(1) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(0) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA(GenOBiC) 
(xi) SEQUENCE DESCRIPTICW : 
GGGCCGATCC AGCGGGC 

(1) INFORMATION FOR SEQ ID N0:7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 



14 



EP 0 677 585 A1 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA( Genomic) 
(Xi) SEQUENCE DESCRIPTIOH: 
GGCCCCATAC AACGCGC 

(I) IMFORMATIOH FOR SEQ ID N0:8; 
(i) SEQUENCE CHARACTERISTICS: 

(A) LEHCarr: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(II) MOLECULE TYPE: DNA(GenOttlc) 
(Xl) SEQUENCE DESCRIPTION: 
OGTCCTATTC AACGTGC 

(1) INFORMATION FOR SEQ ID M0:9: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(II) MOLECULE TYPE: DNA(Genomic) 
(xl) SEQUENCE KSCRIPTION: 
CGAGCAGATG TAATGGA 

(1) INFORMATICMI FOR SEQ ID NO: 10 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(II) MOLECULE TYPE: DNA(Genoilic) 
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(Xi) SEQUENCE DESCRIPTION; 
AGGGCGGACG TGATGGA 

(I) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDCDNESS: Sin9le 

(D) TOPOLOGY: Linear 

(II) MOLECULE TYPE: DNA(Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGCGCC6ATG TCATGGA 

(1) INFORMATI(»l FOR SEQ ID NO:12 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA( Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGTGCTGATG TTATGGA 

(1) INFORMATION FOR SEQ ID NO; 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENQfT: 27 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA(Genonic) 
(xi) SEQUENCE DESCRIPTION: 
ATCGTAACCG CGGACGGGAT TTCTCCC 
(1) INFORMATION FOR SEQ ID NO: 14 
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(i) SEQUENCE CHIUIACTERISTICS: 

(A) LENGHT: 22 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(ix) FEATURE: 

(A) NAME: Priner 

(xi) SEQUENCE DESCRIPTION: 

GGGAATTCTT ATGACACGTC AG 

(1) INFORMATION FOR SEQ ID MO: 15 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(ix) FEATURE: 

(A) NAME: Primer 
(xi) SEQUENCE DESCRIPTION: 
CCCAAGCTTC AAAATTCCGC GAT 23 
(1) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 13 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA( Genomic) 
(ix) FEATURE: 

(A) NAME: Linker 
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(xi) SEQUENCE DESCRIPTION: 

AXTTCTTATG GAT 13 

(I) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 76 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(II) MOLECULE TYPE: DNA{ Genomic) 
(Ix) FEATURE: 

(A) NAME: Linker 

(xl) SEQUENCE DESCRIPTION: 

TCGACGAGGG AACCTACGTG GGGGCGCCGA CGGATGGCCA 40 
GTTCCGGAAG CGCCGCAAAT ACAAGCAATA AGGAGG 76 

(I) INFORMATICW FOR SEQ ID NO: 18: 
<1) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 915 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(II) MOLECULE TYPE: DNA (Genomic) 
(vl) ORIGINAL SOURCE: 

(A) ORGANISM: ^^irobacterium radiobactex 
(xl) SEQUENCE DESCRIPTION: 

ATG ACA CGT CAG ATG ATA CTT GCT GTC GGA CAG CAA GGC CCC ATC 
Met Thr Arg Gin Met He Leu Ala Val 61y Gin Gin Gly Pro He 

5 10 15 

GCG CGA GCG GAG ACA CGC GAA CAG GTG GTT GGC CGC CTC CTC GAC 
Ala Arg Ala Glu Thr Arg Glu Gin Val Val Gly Arg Leu Leu Asp 
20 25 30 
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ATG TTG 
Net Leu 

CCC GAG 
Pro Glu 

GAG GAG 
Asp Glu 

COG GTG 
Pro Val 

GGC TTC 
Gly Phe 

AAG CGT 
Lys Arg 

ATC GTC 
lie Val 



ACG AAC 
Thr Asn 

CTT GCG 
Leu Ala 

GCC GAG 
Ala Glu 

GTC CGT 
Val Arg 

AAT CTG 
Asn Leu 

CGC TTC 
Arg Phe 

GGC AAG 
Gly Lys 



TAG GAG 
Tyr Glu 

GAG CCG 
Glu Pro 

AAA ATG 
Lys Met 



GCC TAC 
Ala Tyr 

GGC GAT 
Gly Asp 

GGG ATG 
Gly Net 



GCA GCC 
Ala Ala 

35 

CTC ACG 
Leu Thr 
50 

CTC GAT 
Leu Asp 
65 

CCA CTC 
Pro Leu 
80 

GGC TAC 
Gly Tyr 
95 

AAC ACG 
Asn Thr 
110 

TAT CGT 
Tyr Arg 
125 

CGG CCG 
Arg Pro 
140 

CTC GGC 
Leu Gly 
155 

TTC ATC 
Phe He 
170 



AGC CGG GGC GTG AAC TTC ATC GTC TTT 135 
Ser Arg Gly Val Asn Phe He Val Phe 

40 45 
ACC TTC TTC CCG CGC TGG CAT TTC ACC 180 
Thr Phe Phe Pro Arg Trp His Phe Thr 

55 60 
AGC TTC TAT GAG ACC GAA ATG CCC GGC 225 
Ser Phe Tyr Glu Thr Glu Met Pro Gly 

70 75 
TTV GAG ACG GCC GCC GAA CTC GGG ATC 270 
Phe Glu Thr Ala Ala Glu Leu Gly He 

85 90 
GCC GAA CTC GTC GTC GAA GGC GGC GTC 315 
Ala Glu Leu Val Val Glu Gly Gly Val 
100 105 
TCC ATT CTG GTG GAT AAG TCA GGC AAG 360 
Ser He Leu Val Asp Lys Ser Gly Lys 
115 120 
AAG ATC CAT TTG CCG GGT CAC AAG GAG 415 
Lys He His Leu Pro Gly Hys Lys Glu 
130 135 
TTC CAG CAT CTT GAA AAG CGT TAT TTC 450 
Phe Gin His Leu Glu Lys Arg Tyr Phe 
145 150 
TTC CCG GTC TAT GAC GTC GAC GCC GCG 495 
Phe Pro Val Tyr Asp Val Asp Ala Ala 
160 165 
TGC AAC GAT CGC CGC TGG CCT GAA ACG 540 
Cys Asn Asp Arg Arg Trp Pro Glu Thr 
175 180 



19 



EP 0 677 585 A1 



70 



15 



20 



25 



30 



35 



40 



TGG CGG GTG ATG QGA CTT AA6 GGC GCC GAG ATC ATC TGC GGC GGC 585 
Trp Arg Val Met Gly Leu Lys Gly Ala Glu lie lie Cys Gly Gly 
185 190 195 

TAG AAC ACG CCG ACC CAC AAT CCC CCC GTT CCC CAG CAC GAG CAT 630 
Tyr Asn Thr Pro Thr His Asn Pro Pro Val Pro Gin His Asp His 
200 205 210 

CTG ACG TCC TTC CAC CAC CTT CTG TCG ATG CAG GCC GOG TCG TAC 675 
Leu Thr Ser Phe His His Leu Leu Ser Met Gin Ala Gly Ser Tyr 
215 220 225 

CAA AAC GGC GCC TGG TCC GCG GCG GCC GGC AAG GTC GGC ATG GAG 720 
Gin Asn Gly Ala Trp Ser Ala Ala Ala Gly Lys Val Gly Met Glu 
230 235 240 

GAG GGG TGC ATG CTG CTC GGC CAT TCG TGC ATC GTG GCG CCG ACC 765 
Glu Gly Cys Met Leu Leu Gly His Ser Cys He Val Ala Pro Thr 
245 250 255 

GGC GAA ATC GTT GCC CTG ACC ACG ACG TTG GAA GAC GAG GTG ATC 810 
Gly Glu He Val Ala Leu Thr Thr Thr Leu Glu Asp Glu Val He 
260 265 270 

ACC GCC GCC GTC GAT CTC GAC CGC TGC CGG GAA CTG CGC GAA CAC 855 
Thr Ala Ala Val Asp Leu Asp Arg Cys Arg Glu Leu Arg Glu His 
275 280 285 

ATC TTC AAT TTC AAA GCC CAT CGT CAG CCA CAG CAC TAC GGT CTG 900 
He Phe Asn Phe Lys Ala His Arg Gin Pro Gin His Tyr Gly Leu 
290 295 300 

ATC GCG GAA TTT TGA 
He Ala Glu Phe STOP 
^ (1) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGHT: 1373 base pairs 
^ (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Unear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(vi) ORIGINAL SOURCE 

(A) ORGANISM: Aarobacterium radiobacter 
(Xi) SEQUENCE DESCRIPTION: 

ATG GAT ATC ATC ATC AAG AAC GGA ACC ATC GTA ACC GCG GAC GG6 
Net Asp He He He Lys Asn Gly Thr He Val Thr Ala Asp Gly 

5 10 15 

ATT TCT CCC GCC GAT CTC GGA ATC AAG GAT GGC AAG ATC GCC CAG 
He Ser Pro Ala Asp Leu Gly He Lys Asp Gly Lys He Ala Gin 

20 25 30 

ATC GGC GGA ACG TTC GGC CCG GCC GGC CGG ACA ATC GAC GCC TCC 
He Gly Gly Thr Phe Gly Pro Ala Gly Arg Thr He Asp Ala Ser 

35 40 45 

GGC GGC TAG GTT TTT CCG GGC GGC ATC GAC GTT CAT ACG CAT GTC 
Gly Arg Tyr Val Phe Pro Gly Gly He Asp Val His Thr His Val 

50 55 60 

GAG ACG GTC AGC TTC AAC ACG CAG TCG GCC GAC ACA TTC GCA ACC 
Glu Thr Val Ser Phe Asn Thr Gin Ser Ala Asp Thr Phe Ala Thr 

65 70 75 

GCG ACG GTC GCG GCC GCC TGT GGC GGC ACG ACG ACC ATC GTC GAT 
Ala Thr Val Ala Ala Ala Cys Gly Gly Thr Thr Thr He Val Asp 

80 85 90 

TTC TGC CAG CAG GAC CGC GGC CAT AGC CTG AGG GAG GCG GTC GCC 
Phe Cys Gin Gin Asp Arg Gly Hys Ser Leu Arg Glu Ala Val Ala 
95 100 105 

AAA TGG GAC GGC ATG GCC GGC GGC AAG TCG GCG ATC GAC TAC GGC 
Lys Trp Asp Gly Met Ala Gly Gly Lys Ser Ala He Asp Tyr Gly 
110 115 120 
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10 



IS 



20 



25 



30 



36 



40 



45 



50 



TAC CAT ATC 
Tyr His lie 

CTA GAG GTA 
Leu Glu Val 

ATG OCT TAT 
Met Ala Tyr 

AGO ACG CTC 
Arg Ttir Leu 

CAC GCG GAG 
His Ala Glu 

GTC GCC GAT 
Val Ala Asp 

CCG CCC CGG 
Pro Pro Arg 

GCG GAA ATC 
Ala Glu lie 

GAA GAA TCC 
Glu Glu Ser 

CAC GCC CTG 
His Ala Leu 



ATC GTG 
He Val 
125 
CTG CCA 
Leu Pro 
140 
CGC OGC 
Arg Gly 
155 
GAC AAG 
Asp Lys 
170 
AAC GGC 
Asn Gly 
185 
GGC AAA 
Gly Lys 
200 
GTC GAA 
val Glu 
215 
GTC AAC 
Val Asn 
230 
TTC GAC 
Phe Asp 
245 
GCC GAA 
Ala Glu 
260 



CTC GAT 
Leu Asp 



GAT CTC 
Asp Leu 

ATG AAC 
Met Asn 

GCC GCC 
Ala Ala 

GAC GCC 
Asp Ala 

ACG GCG 
Thr Ala 

GCC GAG 
Ala Glu 

GCC CCG 
Ala Pro 

GAG TTG 
Glu Leu 

ACC TGC 
Thr Cys 



CCG ACT GAT AGC GTG ATC GAG GAG 405 
Pro Thr Asp Ser Val He Glu Glu 
130 135 
GGC ATC ACC TCC TTC AAG GTC TTC 450 
Gly He Thr Ser Phe Lys Val Phe 
145 150 
ATG ATC GAC GAC GTG ACA CTG CTC 495 
Met He Asp Asp Val Thr Leu Leu 
160 165 
AAG ACT GGG TCA CTC GTC ATG GTG 540 
Lys Thr Gly Ser Leu Val Met Val 
175 180 
GCC GAC TAT CTT CGC GAC AAG TTC 585 
Ala Asp Tyr Leu Arg Asp Lys Phe 
190 195 
CCG ATC TAC CAC GCG CTC AGC CGT 630 
Pro He Tyr His Ala Leu Ser Arg 
205 210 
GCG ACC GCG AGG GCC CTC GCC CTG 675 
Ala Thr Ala Arg Ala Leu Ala Leu 
220 225 
ATC TAC ATC GTG CAT CTG ACC TGC 720 
He Tyr He Val His Leu Thr Cys 
235 240 
ATG CGG GCA AAG GCT CGG GGT GTC 765 
Net Arg Ala Lys Ala Arg Gly Val 
250 255 
ACA CAA TAC CTC TAC CTC ACC AAG 810 
Thr Gin Tyr Leu Tyr Leu Thr Lys 
265 270 
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GAC GAC CTG GAG CGG CCG GAT TTC GAG GGC GCG AAG TAT GTT TTC 855 
Asp Asp Leu Glu Arg Pro Asp Phe Glu Gly Ala Lys Tyr Val Pha 

275 280 285 

ACC CCG CCT CCG CGC ACG AAG AAG GAC CAG GAA ATC CTC TGG AAC 900 
Thr Pro Pro Pro Arg Thr Lys Lys Asp Gin Glu He Leu Trp Asn 

290 295 300 

GCA CTC CGG AAC GGG GTC CTC GAA ACG GTC TCC TCG GAC CAT TGT 945 
Ala Leu Arg Asn Gly Val Leu Glu Thr Val Ser Ser Asp His Cys 

305 310 315 

TCC TGG CTC TTC GAG GGG CAC AAG GAT CGG GGC AGG AAC GAC TTC 990 
Ser Trp Leu Phe Glu Gly His Lys Asp Arg Gly Arg Asn Asp Phe 

320 325 330 

CGC GCC ATC CCG AAC GGA GCG CCG GGC GTC GAG GAG CGG CTG ATG 1035 
Arg Ala lie Pro Asn Gly Ala Pro Gly Val Glu Glu Arg Leu Met 

335 340 345 

ATG GTC TAT CAG GGC GTC AAC GAA GGC CGC ATT TCC CTC ACC CAG 1080 
Net Val Tyr Gin Gly Val Asn Glu Gly Arg lie Ser Leu Thr Gin 

350 355 360 

TTC GTA GAA CTG GTC GCC ACG CGC CCG GCC AAG GTC TTC GGC ATG 1125 
Phe Val Glu Leu Val Ala Thr Arg Pro Ala Lys Val Phe Gly Met 

365 370 375 

TTC CCG GAA AAA GGA ACG GTC GCG GTC OGT TCG GAT GCC GAC ATC 1170 
Phe Pro Glu Lys Gly Thr Val Ala Val Gly Ser Asp Ala Asp lie 

380 385 390 

GTC CTT TGG GAT CCC GAG GCT GAA ATG GTG ATC GAA CAA AGC GCC 1215 
val Leu Trp Asp Pro Glu Ala Glu Met Val He Glu Gin Ser Ala 

395 400 405 

ATG CAT AAC GCC ATG GAT TAC TCC TCC TAC GAG GGA CAC AAG ATC 1260 
Met His Asn Ala Met Asp Tyr Ser Ser Tyr Glu Gly His Lys He 

410 415 420 
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AAG GGC GTG CCG AAG ACA GTG CTG CTC COT GGC AAG GTG ATC GTC 1305 
Lya Gly Val Pro Lys Thr Val Leu Leu Arg Gly Lys Val lie Val 

425 430 435 

GAC GAG GGA ACC TAG GTG GGG GCG CCG ACG GAT GGC CAG TTC CGG 1350 
Asp Glu Gly Thr Tyr Val Gly Ala Pro Thr Asp Gly Gin Phe Arg 

440 445 450 

AAG CGC CGC AAA TAC AAG CAA TAA 1373 
Lya Arg Arg Lys Tyr Lys Gin STOP 

455 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: ENIRICERCHE S-p,A. 

(B) STREET: VIA MARITANO, 26 

(C) CITY: SAN DONATO MILANESE, (MILANO) 

(E) COUNTRY: ITALY 

(F) POSTAL CODE: 20097 
TELEFAX: 02/52036344 

(ii) TITLE INVENTION: 

PROCESS FOR THE PRODUCTION OF D-N-ALFA AMINO ACIDS 

(iii) NUMBER OF SEQUENCES: 19 

(iv) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Diskette 

(iv) COMPUTER: IBM PC-Compatible 
OPERATING SYSTEM: IBM-DOS 5-2/WINDOWS 3-1 
SOFTWARE :DisplayWrite 4 
(1) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 2 3 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 

CGAATTGTAA ATTATGCAGC AGC 23 
(1) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
AGGATCGTGA ACTACGCGGC GGC 23 
(1) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA {Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGCATAGTCA ATTATGCCGC CGC 
(1) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGTATTGTTA ATTATGCTGC TGC 

(1) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
GGACCAATTC AACGAGC 

(1) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
GGGCCGATCC AGCGGGC 

(1) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGHT: 17 base pairs 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(Xi) SEQUENCE DESCRIPTION: 
GGCCCCATAC AACGCGC 
(1) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
GGTCCTATTC AACGTGC 

(1) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGAGCAGATG TAATGGA 

(1) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
AGGGCGGACG TGATGGA 

(1) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGCGCCGATG TCATGGA 
(1) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 17 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
CGTGCTGATG TTATGGA 

(1) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 27 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(xi) SEQUENCE DESCRIPTION: 
ATCGTAACCG CGGACGGGAT TTCTCCC 

(1) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 22 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(ix) FEATURE: 

(A) NAME: Primer 

(xi) SEQUENCE DESCRIPTION: 

GGGAATTCTT ATGACACGTC AG 

(1) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 23 base pairs 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 

(ix) FEATURE: 

(A) NAME: Primer 

(xi) SEQUENCE DESCRIPTION: 

CCCAAGCTTC AAAATTCCGC GAT 23 

(1) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 13 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(ix) FEATURE: 

(A) NAME: Linker 

(xi) SEQUENCE DESCRIPTION: 

AATTCTTATG GAT 13 

(1) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 76 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(ix) FEATURE: 

(A) NAME: Linker 

(xi) SEQUENCE DESCRIPTION: 

TCGACGAGGG AACCTACGTG GGGGCGCCGA CGGATGGCCA 
GTTCCGGAAG CGCCGCAAAT ACAAGCAATA AGGAGG 
(1) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGHT: 915 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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15 



(ii) MOLECULE TYPE: DNA (Genomic) 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aarobacterium radiobacter 
(xi) SEQUENCE DESCRIPTION: 

ATG AC A CGT CAG ATG ATA CTT GCT GTC GGA CAG CAA GGC CCC ATC 45 
Met Thr Arg Gin Met He Leu Ala Val Gly Gin Gin Gly Pro He 

5 10 15 

GCG CGA GCG GAG ACA CGC GAA CAG GTG GTT GGC CGC CTC CTC GAC 90 
Ala Arg Ala Glu Thr Arg Glu Gin Val Val Gly Arg Leu Leu Asp 

20 25 30 

ATG TTG ACG AAC GCA GCC AGC CGG GGC GTG AAC TTC ATC GTC TTT 135 
Met Leu Thr Asn Ala Ala Ser Arg Gly Val Asn Phe He Val Phe 
35 40 45 

20 CCC GAG CTT GCG CTC ACG ACC TTC TTC CCG CGC TGG CAT TTC ACC 180 
Pro Glu Leu Ala Leu Thr Thr Phe Phe Pro Arg Trp His Phe Thr 
50 55 60 

GAC GAG GCC GAG CTC GAT AGC TTC TAT GAG ACC GAA ATG CCC GGC 225 
Asp Glu Ala Glu Leu Asp Ser Phe Tyr Glu Thr Glu Met Pro Gly 

65 70 75 

CCG GTG GTC CGT CCA CTC TTT GAG ACG GCC GCC GAA CTC GGG ATC 270 
Pro Val Val Arg Pro Leu Phe Glu Thr Ala Ala Glu Leu Gly He 

80 85 90 

GGC TTC AAT CTG GGC TAC GCC GAA CTC GTC GTC GAA GGC GGC GTC 315 
Gly Phe Asn Leu Gly Tyr Ala Glu Leu Val Val Glu Gly Gly Val 
95 100 105 

AAG CGT CGC TTC AAC ACG TCC ATT CTG GTG GAT AAG TCA GGC AAG 360 
Lys Arg Arg Phe Asn Thr Ser He Leu Val Asp Lys Ser Gly Lys 
110 115 120 

^ ATC GTC GGC AAG TAT CGT AAG ATC CAT TTG CCG GGT CAC AAG GAG 405 
He Val Gly Lys Tyr Arg Lys He His Leu Pro Gly Hys Lys Glu 
125 130 135 

TAC GAG GCC TAC CGG CCG TTC CAG CAT CTT GAA AAG CGT TAT TTC 450 

45 

Tyr Glu Ala Tyr Arg Pro Phe Gin His Leu Glu Lys Arg Tyr Phe 
140 145 150 

GAG CCG GGC GAT CTC GGC TTC CCG GTC TAT GAC GTC GAC GCC GCG 495 
50 Glu Pro Gly Asp Leu Gly Phe Pro Val Tyr Asp Val Asp Ala Ala 

155 160 165 
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20 



25 



AAA ATG GGG ATG TTC ATC TGC AAC GAT CGC CGC TGG CCT GAA ACG 540 
Lys Met Gly Met Phe He Cys Asn Asp Arg Arg Trp Pro Glu Thr 
170 175 180 

5 

TGG CGG GTG ATG GGA CTT AAG GGC GCC GAG ATC ATC TGC GGC GGC 585 
Trp Arg Val Met Gly Leu Lys Gly Ala Glu He He Cys Gly Gly 
185 190 195 

70 TAG AAC ACG CCG AGO CAC AAT CCC CCC GTT CCC GAG CAC GAC CAT 630 

Tyr Asn Thr Pro Thr His Asn Pro Pro Val Pro Gin His Asp His 
200 205 210 

CTG ACG TCC TTC CAC CAC CTT CTG TCG ATG CAG GCC GGG TCG TAG 675 
Leu Thr Ser Phe His His Leu Leu Ser Met Gin Ala Gly Ser Tyr 
215 220 225 

CAA AAC GGC GCC TGG TCC GCG GGG GCC GGC AAG GTC GGC ATG GAG 720 
Gin Asn Gly Ala Trp Ser Ala Ala Ala Gly Lys Val Gly Met Glu 
230 235 240 

GAG GGG TGC ATG CTG CTC GGC CAT TCG TGC ATC GTG GCG CCG ACC 765 
Glu Gly Cys Met Leu Leu Gly His Ser Cys He Val Ala Pro Thr 
245 250 255 

GGC GAA ATC GTT GCC CTG ACC ACG ACG TTG GAA GAC GAG GTG ATC 810 
Gly Glu He Val Ala Leu Thr Thr Thr Leu Glu Asp Glu Val He 
260 265 270 

^0 ACC GCC GCC GTC GAT CTC GAC CGC TGC CGG GAA CTG CGC GAA CAC 855 

Thr Ala Ala Val Asp Leu Asp Arg Cys Arg Glu Leu Arg Glu His 
275 280 285 

ATC TTC AAT TTC AAA GCC CAT CGT CAG CCA CAG CAC TAG GGT CTG 900 
He Phe Asn Phe Lys Ala His Arg Gin Pro Gin His Tyr Gly Leu 
290 295 300 

ATC GCG GAA TTT TGA 
He Ala Glu Phe 
(1) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGHT: 1374 base pairs 

45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(vi) ORIGINAL SOURCE 
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(A) ORGANISM: Aqrobac terium radioba<;ter 
(xi) SEQUENCE DESCRIPTION: 

AT6 GAT ATC ATC ATC AAG AAC GGA ACC ATC 6TA ACC GCG GAC GGG 45 
Met Asp lie He He Lys Asn Gly Thr He Val Thr Ala Asp Gly 

5 10 15 

ATT TCT CCC GCC GAT CTC GGA ATC AAG GAT GGC AAG ATC GCC CAG 90 
He Ser Pro Ala Asp Leu Gly He Lys Asp Gly Lys He Ala Gin 

20 25 30 

ATC GGC GGA ACG TTC GGC CCG GCC GGC CGG AC A ATC GAC GCC TCC 135 
15 He Gly Gly Thr Phe Gly Pro Ala Gly Arg Thr He Asp Ala Ser 

35 40 45 

GGC CGC TAG GTT TTT CCG GGC GGC ATC GAC GTT CAT ACG CAT GTC 180 
Gly Arg Tyr Val Phe Pro Gly Gly He Asp Val His Thr His Val 

50 55 60 

GAG ACG GTC AGC TTC AAC ACG CAG TCG GCC GAC AC A TTC GCA ACC 225 
Glu Thr Val Ser Phe Asn Thr Gin Ser Ala Asp Thr Phe Ala Thr 

65 70 75 

GCG ACG GTC GCG GCC GCC TGT GGC GGC ACG ACG ACC ATC GTC GAT 270 
Ala Thr Val Ala Ala Ala Cys Gly Gly Thr Thr Thr He Val Asp 
80 85 90 

30 TTC TGC CAG CAG GAC CGC GGC CAT AGC CTG AGG GAG GCG GTC GCC 315 
Phe Cys Gin Gin Asp Arg Gly Hys Ser Leu Arg Glu Ala Val Ala 
95 100 105 

AAA TGG GAC GGC ATG GCC GGC GGC AAG TCG GCG ATC GAC TAG GGC 360 
^ Lys Trp Asp Gly Met Ala Gly Gly Lys Ser Ala He Asp Tyr Gly 

110 115 120 

TAG CAT ATC ATC GTG CTC GAT CCG ACT GAT AGC GTG ATC GAG GAG 405 
Tyr His He He Val Leu Asp Pro Thr Asp Ser Val He Glu Glu 
125 130 135 

CTA GAG GTA CTG CCA GAT CTC GGC ATC ACC TCC TTC AAG GTC TTC 450 
Leu Glu Val Leu Pro Asp Leu Gly He Thr Ser Phe Lys Val Phe 
140 145 150 

ATG GCT TAT CGC GGC ATG AAC ATG ATC GAC GAC GTG AC A CTG CTC 495 
Met Ala Tyr Arg Gly Met Asn Met He Asp Asp Val Thr Leu Leu 
155 160 165 

50 AGG ACG CTC GAC AAG GCC GCC AAG ACT GGG TCA CTC GTC ATG GTG 540 
Arg Thr Leu Asp Lys Ala Ala Lys Thr Gly Ser Leu Val Met Val 
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IS 



170 175 180 

CAC GCG GAG AAC GGC GAC GCC GCC GAC TAT CTT CGC GAC AAG TTC 585 
His Ala Glu Asn Gly Asp Ala Ala Asp Tyr Leu Arg Asp Lys Phe 
185 190 195 

GTC GCC GAT GGC AAA ACG GCG CCG ATC TAG CAC GCG CTC AGC CGT 630 
Val Ala Asp Gly Lys Thr Ala Pro lie Tyr His Ala Leu Ser Arg 
200 205 210 

CCG CCC CGG GTC GAA GCC GAG GCG ACC GCG AGG GCC CTC GCC CTG 675 
Pro Pro Arg Val Glu Ala Glu Ala Thr Ala Arg Ala Leu Ala Leu 
215 220 225 

GCG GAA ATC GTC AAC GCC CCG ATC TAC ATC GTG CAT CTG ACC TGC 720 
Ala Glu lie Val Asn Ala Pro lie Tyr He Val His Leu Thr Cys 
230 235 240 

20 GAA GAA TCC TTC GAC GAG TTG ATG CGG GCA AAG GCT CGG GGT GTC 765 

Glu Glu Ser Phe Asp Glu Leu Met Arg Ala Lys Ala Arg Gly Val 
245 250 255 

CAC GCC CTG GCC GAA ACC TGC AC A CAA TAC CTC TAC CTC ACC AAG 810 
His Ala Leu Ala Glu Thr Cys Thr Gin Tyr Leu Tyr Leu Thr Lys 
260 265 270 

GAC GAC CTG GAG CGG CCG GAT TTC GAG GGC GCG AAG TAT GTT TTC 855 
Asp Asp Leu Glu Arg Pro Asp Phe Glu Gly Ala Lys Tyr Val Phe 
275 280 285 

ACC CCG CCT CCG CGC ACG AAG AAG GAC CAG GAA ATC CTC TGG AAC 900 
Thr Pro Pro Pro Arg Thr Lys Lys Asp Gin Glu lie Leu Trp Asn 
290 295 300 

GCA CTC CGG AAC GGG GTC CTC GAA ACG GTC TCC TCG GAC CAT TGT 945 
Ala Leu Arg Asn Gly Val Leu Glu Thr Val Ser Ser Asp His Cys 
305 310 315 

^ TCC TGG CTC TTC GAG GGG CAC AAG GAT CGG GGC AGG AAC GAC TTC 990 

Ser Trp Leu Phe Glu Gly His Lys Asp Arg Gly Arg Asn Asp Phe 
320 325 330 

CGC GCC ATC CCG AAC GGA GCG CCG GGC GTC GAG GAG CGG CTG ATG 1035 

45 

Arg Ala He Pro Asn Gly Ala Pro Gly Val Glu Glu Arg Leu Met 
335 340 345 

ATG GTC TAT CAG GGC GTC AAC GAA GGC CGC ATT TCC CTC ACC CAG 1080 
50 Met Val Tyr Gin Gly Val Asn Glu Gly Arg He Ser Leu Thr Gin 

350 355 360 
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TTC GTA GAA CTG GTC GCC ACG CGC CCG GCC A AG GTC TTC GGC ATG 1125 
Phe Val Glu Leu Val Ala Thr Arg Pro Ala Lys Val Phe Gly Met 
365 370 375 

TTC CCG GAA AAA GGA ACG GTC GCG GTC GGT TCG GAT GCC GAC ATC 1170 
Phe Pro Glu Lys Gly Thr Val Ala Val Gly Ser Asp Ala Asp lie 
380 385 390 

GTC CTT TGG GAT CCC GAG GCT GAA ATG GTG ATC GAA CAA AGC GCC 1215 
Val Leu Trp Asp Pro Glu Ala Glu Met Val lie Glu Gin Ser Ala 
395 400 405 

ATG CAT AAC GCC ATG GAT TAC TCC TCC TAC GAG GGA CAC AAG ATC 1260 
Met His Asn Ala Met Asp Tyr Ser Ser Tyr Glu Gly His Lys lie 
410 415 420 

AAG GGC GTG CCG AAG ACA GTG CTG CTG CGT GGC AAG GTG ATC GTC 1305 
20 Lys Gly Val Pro Lys Thr Val Leu Leu Arg Gly Lys Val lie Val 

425 430 435 

GAC GAG GGA ACC TAC GTG GGG GCG CCG ACG GAT GGC CAG TTC CGG 1350 
Asp Glu Gly Thr Tyr Val Gly Ala Pro Thr Asp Gly Gin Phe Arg 
440 445 450 

AAG CGC CGC AAA TAC AAG CAA TAA 1374 
Lys Arg Arg Lys Tyr Lys Gin 
30 455 
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Claims 

35 

1. A process for the production of D-xi-amino acids by the stereospecific conversion of racemic mixtures 
of 5-substituted hydantoins characterized in that, the conversion reaction is carried out in the presence 
of a microorganism transformed with the plasmid pSM651 CBS 203.94 capable of expressing at high 
levels and without Inducers an enzymatic system capable of converting said hydantoins into the 

40 corresponding l>a-amtno acids. 

2. The process according to claim 1. characterized in that, the conversion reaction is carried out in the 
presence of the enzymatic system isolated from a microorganism transformed with the plasmid 
pSM651 CBS 203.94. 

45 

3. The process according to claim 2. characterized in that, said enzymatic system Is immobilized on an 
insoluble support. 

4. The process according to claim 1, characterized in tiiat, the microorganisms are selected from the 
50 group of Bacillus sutrtilis and Escherichia coli . 

5. The process according to claim 1, characterized in that, tt)e 5-substituted hydantoin is selected from 
D,L-5-phenylhydantoln, D,L-5-para-hydroxyphenylhydantoin, D,L-5-mettiylhydantoin, D,L-5-isopropyl- 
hydantoin, D,L-5-thienylhydantoin, D,L-5-para-methoxyphenylhydantoin, D,L-5-para-chloro phenylhydan- 

55 toin, D,L-5-benzylhydarrtoin. 

6. The process according to claim 5, characterized in that, the hydantoin is D,L-5-para-hydroxyphenyl- 
hydantoin. 
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7. The process according to claim 5, characterized in that, the hydantoin is D,L-5-phenylhydantoin. 

& The process according to claim 1, characterized In that, the conversion reaction is carried out at a 
teniperature of between 20 • C and 60 " C. 

9. The process according to claim 8, characterized in that, the temperature is ttetween 30 * and 45 * C. 

10. The process according to claim 1, characterized in that, the conversion reaction is carried out at a pH 
of between 6.0 and 10. 

11. The process according to claim 10. characterized in that, the pH is between 7.0 and 8.5. 

12. The process according to claim 1, characterized In that, the conversion reaction is carried out using a 
weight ratio biomass/hydantoins of t)etween 1/1 and 1/50. 

ia Plasmid pSM651 deposited at the Bureau Voor SchimmelcuKures. SK Baam (Holland) where it has 
received the deposit numt>er CBS 203.94. 

14. A microorganism selected from Bacillus subtilis and Escherichia coll transformed with the plasmid 
20 pSM651. 

15. The microorganism according to claim 10, which is Escherichia coli SMC305 CBS 203.94. 
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FIG. 4A 



ATG ACA 
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GCG CGA 
Ala hrg 

ATG TTG 
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COC GAG 
Pro GIu 

GAC GAG 
Asp Glu 

CCG GTG 
Pro Val 

GGC TTC 
Gly Pho 

AAG OCT 
Lys Arg 

ATC GTC 
He Val 

TAG GAG 
Tyr Glu 



CGI CAG ATG 
Arg Gin Net 
5 

GCG GAG ACA 
Ala Glu Thr 
20 

ACG AAC GCA 
Thr Aan Ala 

35 

CTT GOG CTC 
Leu Ala Leu 
50 

GCG CAG CTC 
Ala Glu Leu 
65 

CTC C6T CCA 
Val Arg Pro 
80 

AAT CTC GOC 
Asn L*u Gly 
9S 

GGC TTC AAC 
Arg Phe Asn 
110 

OOC AM? TAT 
Gly Lys Tyr 

125 

GCC TAC COG 
Ala Tyr Arg 
140 



ATA CTT GOT 
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COC GAA CAG 
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GCC AGC CGG 
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ACG ACC TTC 
Thr Thr Pha 

GAT AGC TTC 
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CTC TTT GAG 
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TAC GCC GAA 
Tyr Ala Glu 

ACG TCC ATT 
Thr Sar I la 

COT AAG ATC 
Arg Lys I la 

CCG TTC CAG 
Pro Pha Gin 



CTC GGA 
Val Gly 

10 
GTG GTT 
Val Val 

25 
GGC GTG 
Gly Val 

40 
TTC CCG 
Phe Pro 

55 
TAT GAG 
Tyr Glu 

70 
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Thr Ala 

85 
CTC GTC 
Lau Val 

100 
CTG GTG 
Lau Val 

115 
CAT TTG 
His Lau 

130 
CAT CTT 
His Leu 
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Asp 

CCG 
Pro 
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Glu 



CAA GGC 
Gin Gly 

COC CTC 
Arg Lau 

TTC ATC 
Pha Ila 

TOG CAT 
Trp His 

GAA ATG 
Glu Nat 

GAA CTC 
Glu Lau 
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AAG TCA 
Lys Sar 

OCT CAC 
Gly Bys 
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CCC ATC 
Pro Ila 

15 
CTC GAC 
Lau Asp 

30 
GTC TTT 
Val Pha 

45 
TTC AOC 
Pha Thr 

60 
CCC GGC 
Pro Gly 

75 
GOG ATC 
Gly Ila 
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GGC GTC 
Gly Val 
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GOC AAG 
Gly Lys 
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AAG GAG 
Lys Glu 
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TAT TTC 
Tyr Pha 

150 
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FIG. 4B 

GAG CCG GGC CAT CTC GGC TTC CCG GTC TAT GAC GTC GAG GCC GCC 495 
Glu Pro Gly Asp Leu Gly Phe Fro Val Tyr Asp Val Asp Ala Ala 

155 160 165 

AAA ATG GGG ATG TTC ATC TGC AAC GAT CGC GGC TCG OCT GAA ACG 540 
Lys Nat Gly Net Phe He Cys Asn Asp Arg Arg Trp Pro Glu Thr 

170 175 180 

TGG CGG GTG ATG GGA CTT AAG GGC GCC GAG ATC ATC TGC GGC GGC 585 
Trp Arg Val Met Gly Leu Lys Gly Ala Glu He He Cys Gly Gly 

185 190 195 

TAG AAC ACG CCG ACC CAC AAT CCC GCC GTT COC CAG CAC CAC CAT 630 
Tyr Asn Thr Pro Thr Hie Asn Pro Pro Val Pro Gin His Asp His 

200 205 210 

CTG ACG TCC TTC CAC CAC CTT CTG TCG ATG CAG GCC GGG TCG TAC 675 
Leu Thr Ser Phe His His Leu Leu Ser Met Gin Ala Gly Ser Tyr 

215 220 225 

CAA AAC GGC GCC TGG TCC GCG GCG GCC GGC AAG GIC GGC ATG GAG 720 
Gin Asn Gly Ala Trp Ser Ala Ala Ala Gly Lys Val Gly Met Glu 

230 235 240 

GAG GOG TGC ATG CTG CTC GGC CAT TCG TGC ATC CTG GCG CCG ACC 765 
Glu Gly Cys Net Leu Leu Gly His Ser Cys He Val Ala Pro Thr 

245 250 255 

GGC GAA ATC GTT GCC CTG ACC ACG ACG TTC GAA GAC GAG GIG ATC 810 
Gly Glu He Val Ala Leu Thr Thr Thr Leu Glu Asp Glu Val He 

260 265 270 

ACC GCC GCC GTC GAT CTC CAC CGC TGC CGG GAA CTG CGC GAA CAC 855 
Thr Ala Ala Val Asp Leu Asp Arg Cys Arg Glu Leu Arg Glu His 

275 280 285 

ATC TTC AAT TTC AAA GCC CAT CGT CAG CCA CAG CAC TAC CGT CTG 900 
He Phe Asn Phe Lys Ala His Arg Gin Pro Gin His Tyr Gly Leu 

290 295 300 

ATC GCG GAA TTT TGA 915 
He Ala Glu Phe STOP 
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FIG. 5A 



ATG ChT 
Met Asp 

ATT TCT 
lie Ser 

ATC OOC 
He Gly 

OOC CGC 
Gly Arg 

GAG ACG 
Glu Thr 

GCG ACG 
Ala Thr 

TTC TOC 
Thm Cys 

AAA TGG 
Lys Trp 



AIC ATC 
lie Il« 

CCC GCC 
Pro AU 

GGA ACG 
Gly Thr 

TAG GTT 
Tyr Val 

GTC AGC 
Val Sar 

GTC GCG 
Val Ala 

CAG CAG 
Gin Gin 

GAC GGC 
Asp Gly 



TAC CAT 
Tyr His 

CTA GAG 
Leu Glu 



ATC ATC 
lis Ila 

GTA CTG 
Val Leu 



ATC AAG AAC 
lie Lys Asn 

5 

GAT CIC GGA 
Asp Leu Gly 
20 

TTC GGC CCG 
Fhe Gly Pro 
35 

TTT CCG GGC 
Phe Pro Gly 
50 

TTC AAC ACG 
Phe Asn Thr 
65 

GCC GCC TGT 
Ala Ala Cys 
80 

GAC CGC GGC 
Asp Arg Gly 

95 

ATG GCC GGC 
Met Ala Gly 
110 

GTG CTC GAT 
Val Leu Asp 

125 

CCA GAT CTC 
Pro Asp Leu 



GGA ACC ATC 
Gly Thr lie 
10 

ATC AAG GAT 
lie Lys Asp 
25 

GCC Goc cxx; 

Ala Gly Arg 
40 

GGC ATC GAC 
Gly lie Asp 
55 

CAG TCG GCC 
Gin Ser Ala 

70 

GGC GGC ACG 
Gly Gly Thr 
85 

CAT AGC CTG 
Rys Ser Leu 

100 

GGC AAG TCG 
Gly Lys Ser 
115 

CCG ACT GAT 
Pro Thr Asp 

130 

GGC ATC ACC 
Gly He Thr 



GTA ACC GCG GAC GGG 45 
Val Thr Ala Asp Gly 
15 

GGC AAG ATC GCC CAG 90 
Gly Lys He Ala Gin 
30 

ACA ATC GAC GCC TCC 135 
Thr He Asp Ala Ser 
45 

GTT CAT ACG CAT GTC 180 
Val His Thr His Val 
60 

GAC ACA TTC GCA AOC 225 
Asp Thr Phe Ala Thr 

75 

ACG ACC ATC GTC GAT 270 
Thr Thr He Val Asp 
90 

AGO GAG GCG GTC GCC 315 
Arg Glu Ala Val Ala 
105 

GCG ATC GAC TAC GGC 360 
Ala He Asp Tyr Gly 
120 

AGC GTG ATC GAG GAG 405 
Ser Val He Glu Glu 

135 

TCC TTC AAG GTC TTC 450 
Ser Phe Lys Val Phe 
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FIG. 5B 
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GGC ATG AAC 
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AAG GCC OCC 
Lya Ala Ala 

170 

GGC GAC GCC 
Gly Aap Ala 
185 

AAA ACG GCG 
Lya Thr Ala 
200 

GAA GCC GAG 
Glu Ala Glu 
215 

AAC GCC CCG 
Asn Ala Pro 
230 

GAC GAG TTG 
Aap Glu Lau 
245 

GAA ACC TGC 
Glu Thr Cya 
260 

CGG COG GAT 
Arg Pro Asp 

275 

CGC ACG AAG 
Arg Thr Lya 
290 

GGG GTC CTC 
Gly Val Lau 



ATG ATC 
Nat I la 

AAG ACT 
Lya Thr 

OCC GAC 
Ala Asp 

CCG ATC 
Pro I la 

OCG AOC 
Ala Thr 

ATC TAC 
I la Tyr 

ATG CGG 
Nat Arg 

ACA CAA 
Thr Gin 

TTC GAG 
Pha Glu 
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GAC GAC 
Asp Aap 
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GGG TCA 
Gly Sar 
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TAT CIT 
Tyr Lau 
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TAC CAC 
Tyr His 
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GCG AGO 
Ala Arg 

220 

ATC GTG 
I la Val 
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OCA AAG 
Ala Lys 

250 

TAC CTC 
Tyr Lau 
265 

GGC GCG 
Gly Ala 
280 

CAG GAA 

Gin Glu 
295 

GTC TCC 
Val Sar 



GTG ACA 
Val Thr 

CTC GTC 
Lau Val 

COC GAC 
Arg Asp 

GCG CTC 
Ala Lau 

GCC CTC 
Ala Lau 

CAT CTG 
His Lau 

GCT CGG 
Ala Arg 

TAC CTC 
Tyr Lau 

AAG TAT 
Lya Tyr 

ATC CTC 
He Lau 

TCC GAC 
Sar Aap 
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CTG CTC 
Leu Lau 
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ATG GTG 
Met Val 
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AAG TTC 
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AGC CGT 
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FIGr 5C 

305 310 315 

TCC TGG CTC TTC GAG OGG CAC AAG GAT COG GGC AGG AAC GAG TTC 990 
Ser Trp Leu Phe Glu Gly His Lys Asp Arg Gly Arg Asn Asp Phe 
320 325 330 

CGC GGC ATC CCG AAC GGA GOG COG GGC GTC GAG GAG CGG CTG ATG 1035 
Arg Ala lie Pro Asn Gly Ala Pro Gly V«l Glu Glu Arg Leu Met 
335 340 345 

hTG GTC TAT CAG OOC GTC AAC GAA GGC CGC ATT TOC CTC AOC CAG 1060 
Met Val Tyr Gin Gly Val Asn Glu Gly Arg He Ser Leu Thr Gin 
350 355 360 

TTC GTA GAA CTG GTC GGC ACG CGC CCG GCC AAG GTC TTC GGC ATG 1125 
Phe Val Glu Leu Val Ala Thr Arg Pro Ala Lys Val Phe Gly Met 
365 370 375 

TTC CCG GAA AAA GGA AOGCTCOCCGTCCCTTCGCATGCCGACATC 1170 
Phe Pro Glu Lys Gly Thr Val Ala Val Gly Ser Asp Ala Asp He 
380 385 390 

GTC CTT TGG GAT CCC GAG GCT GAA ATG GTC ATC GAA GAA AGC GCC 1215 
Val Leu Trp Asp Pro Glu Ala Glu Met Val lie Glu Gin Ser Ala 
395 400 405 

ATG CAT AAC GCC ATG GAT TAG TCC TCC TAC GAG GGA CAC AAG ATC 1260 
Met His Asn Ala Met Asp Tyr Ser Ser Tyr Glu Gly His Lys He 
410 415 420 

AAG GGC CTG CCG AAG ACA GTG CTG CTG CGT GGC AAG GTG ATC GTC 1305 
Lys Gly Val Pro Lys Thr Val Leu Leu Arg Gly Lys Val He Val 
425 430 435 

GAG GAG GGA ACC TAC GTG OGG GCG CCG ACG GAT GGC CAG TTC CGG 1350 
Asp Glu Gly Thr Tyr Val Gly Ala Pro Thr Asp Gly Gin Phe Arg 
440 445 450 

AAG CGC CGC AAA TAC AAG CAA TAA 1373 
Lys Arg Arg Lys Tyr Lys Gin STOP 
455 
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